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DETAILED ACTION 

1. Claims 1-19 have been considered. Claims 6-7 have been amended as per Applicants' 
request. 

Papers Submitted 

2. It is hereby acknowledged that the following papers have been received and placed of 
record in the file: Amendment as filed 18 My 2007. 

Claim Rejections - 35 USC § 101 

3. Claims 5 and 14 are rejected under 35 U.S.C. 101 because the claimed recitation of a 
use, without setting forth any steps involved in the process, results in an improper definition of a 
process, i.e., results in a claim which is not a proper process claim under 35 U.S.C. 101. See for 
example Ex parte Dunki, 153 USPQ 678 (Bd.App. 1967) and Clinical Products, Ltd. v. Brenner, 
255 F. Supp. 131, 149 USPQ 475 (D.D.C. 1966). 

4. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or 
any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and 
requirements of this title. 

5. Claim 14 is rejected under 35 U.S.C. 101 because the claimed invention is directed to 
non-statutory subject matter. The claim is drawn to a processing engine with a front end stage 
and an execution unit without any clear language with regards to whether the processing engine 
is implemented in pure hardware, pure software, or a combination of both. If the processing 
engine is implemented in pure software, the claim would be drawn to software per se, which is 
non-statutory subject matter. 

Claim Rejections - 35 USC § 112 

6. The following is a quotation of the second paragraph of 35 U.S.C. 112: 
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The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

7. Claims 5 and 14 rejected under 35 U.S.C. 112, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as 
the invention. 

8. Claims 5 and 14 provide for the use of instructions in reverse program order, but, since 
the claim does not set forth any steps involved in the method/process, it is unclear what 
method/process applicant is intending to encompass. A claim is indefinite where it merely 
recites a use without any active, positive steps delimiting how this use is actually practiced. 

Claim Rejections - 35 USC § 102 

9. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

10. Claims 1-5 and 8-14 are rejected under 35 U.S.C. 102(e) as being taught by Kyker et al., 
U.S. Patent Number 6,578,138 (herein referred to as Kyker). 

1 1 . Referring to claim 1 , Kyker has taught a cache comprising: 

a. A cache line to store an instruction segment further comprising a plurality of 

instructions stored in sequential positions of cache line in reverse program order 

(Kyker Abstract ". . .a cache unit, which includes a data array that stores 

traces. . .In one exemplary method of unrolling loops, the processor or trace cache 
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unrolls loops. . ."; column 2, line 59 to column 3, line 33 ". . .when the trace cache 
determines that a loop is present, the trace cache continues to build the trace by 
building additional iterations of the loop until the trace is a minimal length. . .In 
other words, the trace cache builds the loop repeatedly until the trace is, for 
example, over two trace lines long. . ."; Figure 1 ; and Figure 2 - In regards to 
Kyker, as shown in Figure 2 and explained the in the cited lines, the loop has 
three main instructions and, when the backwards branch is encountered, the 
program jumps backwards, i.e. reverses directions, to the beginning of the loop 
instead of moving forward in the program. As shown in Figure 2, when L H is 
encountered, previously stored instructions L 2 and L 3 are stored again in 
consecutive lines in the cache, e.g. the trace cache reverses the program direction 
and stores previously stored instructions again in the trace cache when a loop is 
encountered.). 

12. Referring to claim 2, Kyker has taught the cache of claim 1 , wherein the instruction 
segment is an extended block (Kyker column 1, lines 30-45 "...the target address, the backward 
taken branch, and any micro-ops between the two form a loop..."; column 2, line 59 to column 
3, line 33 "...Exemplary trace Tl includes a total of four micro-ops... The second exemplary 
trace T2 includes the same loop, L H , L/, L 2 , but does not include any micro-op preceding the 
loop itself..."; Figure 1; and Figure 2 - In regards to Kyker, the loop contains a plurality of 
instructions, i.e. block of instructions, with four instructions and includes the extra micro-op 
preceding the loop itself). 
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13. Referring to claim 3, Kyker has taught the cache of claim 1 , wherein the instruction 
segment is a trace (Kyker Abstract ". . .a cache unit, which includes a data array that stores 
traces. . .In one exemplary method of unrolling loops, the processor or trace cache unrolls 
loops. . ."; column 2, line 59 to column 3, line 33 ". . .when the trace cache determines that a loop 
is present, the trace cache continues to build the trace by building additional iterations of the loop 
until the trace is a minimal length.. Jn other words, the trace cache builds the loop repeatedly 
until the trace is, for example, over two trace lines long. . ."; Figure 1 ; and Figure 2). 

14. Referring to claim 4, Kyker has taught the cache of claim 1, wherein the instruction 
segment is a basic block (Kyker column 1, lines 30-45 "...the target address, the backward taken 
branch, and any micro-ops between the two form a loop. . ."; column 2, line 59 to column 3, line 
33 "...Exemplary trace Tl includes a total of four micro-ops... The second exemplary trace T2 
includes the same loop, L//, L h L 2 , but does not include any micro-op preceding the loop 
itself. . ."; Figure 1 ; and Figure 2 - In regards to Kyker, the loop contains a plurality of 
instructions, i.e. block of instructions, with three instructions and does not include the extra 
micro-op preceding the loop itself.). 

15. Referring to claim 5, Kyker has taught a segment cache for a front-end system in a 
processor (Kyker column 5, line 45 to column 6, line 2 "...The trace cache also includes, for 
example, a control bloc 109 and a instruction decoder 111..."; Figure 10; and Figure 1 1 - In 
regards to Kyker, the trace cache stores portions, e.g. segments, of the program and, as shown in 
Figures 10 and 1 1, is part of the system associated with decoding instructions, which is prior to 
execution and retirement of instructions, i.e. the back-end system.), comprising a plurality of 
cache entries to store instructions of instruction segments in reverse program order (Kyker 
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Abstract ". . .a cache unit, which includes a data array that stores traces. . .In one exemplary 
method of unrolling loops, the processor or trace cache unrolls loops..."; column 2, line 59 to 
column 3, line 33 "...when the trace cache determines that a loop is present, the trace cache 
continues to build the trace by building additional iterations of the loop until the trace is a 
minimal length... In other words, the trace cache builds the loop repeatedly until the trace is, for 
example, over two trace lines long. . ."; Figure 1 ; and Figure 2 - In regards to Kyker, as shown in 
Figure 2 and explained the in the cited lines, the loop has three main instructions and, when the 
backwards branch is encountered, the program jumps backwards, i.e. reverses directions, to the 
beginning of the loop instead of moving forward in the program. As shown in Figure 2, when L H 
is encountered, previously stored instructions L 2 and L 3 are stored again in consecutive lines in 
the cache, e.g. the trace cache reverses the program direction and stores previously stored 
instructions again in the trace cache when a loop is encountered.). 
1 6. Referring to claim 8, Kyker has taught a method comprising: 

a. Building an instruction segment based on program flow (Kyker Abstract ". . .a 
cache unit, which includes a data array that stores traces. . .In one exemplary 
method of unrolling loops, the processor or trace cache unrolls loops. column 
2, line 59 to column 3, line 33 ". . .when the trace cache determines that a loop is 
present, the trace cache continues to build the trace by building additional 
iterations of the loop until the trace is a minimal length. . .In other words, the trace 
cache builds the loop repeatedly until the trace is, for example, over two trace 
lines long. ; ."; Figure 1 ; and Figure 2), and 
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b. Storing instructions of the instruction segment in a cache entry in reverse program 
order (Kyker Abstract . .a cache unit, which includes a data array that stores 
traces. . .In one exemplary method of unrolling loops, the processor or trace cache 
unrolls loops.,."; column 2, line 59 to column 3, line 33 "...when the trace cache 
determines that a loop is present, the trace cache continues to build the trace by 
building additional iterations of the loop until the trace is a minimal length... In 
other words, the trace cache builds the loop repeatedly until the trace is, for 
example, over two trace lines long. . ."; Figure 1 ; and Figure 2 - In regards to 
Kyker, as shown in Figure 2 and explained the in the cited lines, the loop has 
three main instructions and, when the backwards branch is encountered, the 
program jumps backwards, i.e. reverses directions, to the beginning of the loop 
instead of moving forward in the program. As shown in Figure 2, when Lh is 
encountered, previously stored instructions L2 and L3 are stored again in 
consecutive lines in the cache, e.g. the trace cache reverses the program direction 
and stores previously stored instructions again in the trace cache when a loop is 
encountered.). 

17. Referring to claim 9, Kyker has taught the method of claim 8, further comprising: 

a. Building a second instruction segment based on program flow, and if the first and 
second instruction segments, overlap, extending the first instruction segment to 
include non-overlapping instructions from the second instruction segment (Kyker 
Abstract ". . .a cache unit, which includes a data array that stores traces. . .In one 
exemplary method of unrolling loops, the processor or trace cache unrolls 
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loops. . ."; column 2, line 59 to column 3, line 33 . . when the trace cache 
determines that a loop is present, the trace cache continues to build the trace by 
building additional iterations of the loop until the trace is a minimal length... In 
other words, the trace cache builds the loop repeatedly until the trace is, for 
example, over two trace lines long. . ."; Figure 1 ; and Figure 2 - In regards to 
Kyker, as shown in Figure 1, the trace is built as normal, e.g. a first instruction 
segment is built normally, when there is no backward-taken branch. When a 
backward branch is taken to form another, second instruction segment, another 
iteration of the loop is added to the trace, thereby extending the first segment to 
include the first segment.). 
18. Referring to claim 10, Kyker has taught the method of claim 9, wherein the extending 
comprises storing the non-overlapping instructions in the cache in reverse program order in 
successive cache positions adjacent to the instructions from the first instruction segment (Kyker 
Abstract ". . .a cache unit, which includes a data array that stores traces. . .In one exemplary 
method of unrolling loops, the processor or trace cache unrolls loops..."; column 2, line 59 to 
column 3, line 33 ". . .when the trace cache determines that a loop is present, the trace cache 
continues to build the trace by building additional iterations of the loop until the trace is a 
minimal length. ..In other words, the trace cache builds the loop repeatedly until the trace is, for 
example, over two trace lines long..."; Figure 1; and Figure 2 - In regards to Kyker, as shown in 
Figure 2 and explained the in the cited lines, the loop has three main instructions and, when the 
backwards branch is encountered, the program jumps backwards, i.e. reverses directions, to the 
beginning of the loop instead of moving forward in the program. As shown in Figure 2, when L H 
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is encountered, previously stored instructions L 2 and L 3 are stored again in consecutive lines in 
the cache, e.g. the trace cache reverses the program direction and stores previously stored 
instructions again in the trace cache when a loop is encountered.). 

19. Referring to claim 11, Kyker has taught the method of claim 8, wherein the instruction 
segment is an extended block (Kyker column 1, lines 30-45 "...the target address, the backward 
taken branch, and any micro-ops between the two form a loop. . ."; column 2, line 59 to column 
3, line 33 ". . .Exemplary trace Tl includes a total of four micro-ops. . .The second exemplary 
trace T2 includes the same loop, L H , L/, L 2 , but does not include any micro-op preceding the 
loop itself. . ."; Figure 1 ; and Figure 2 - In regards to Kyker, the loop contains a plurality of 
instructions, i.e. block of instructions, with four instructions and includes the extra micro-op 
preceding the loop itself.). 

20. Referring to claim 12, Kyker has taught the method of claim 8, wherein the instruction 
segment is a trace (Kyker Abstract ". . .a cache unit, which includes a data array that stores 
traces. . .In one exemplary method of unrolling loops, the processor or trace cache unrolls 
loops..."; column 2, line 59 to column 3, line 33 "...when the trace cache determines that a loop 
is present, the trace cache continues to build the trace by building additional iterations of the loop 
until the trace is a minimal length... In other words, the trace cache builds the loop repeatedly 
until the trace is, for example, over two trace lines long. . ."; Figure 1 ; and Figure 2). 

21. Referring to claim 13, Kyker has taught the method of claim 8, wherein the instruction 
segment is a basic block (Kyker column 1, lines 30-45 "...the target address, the backward taken 
branch, and any micro-ops between the two form a loop..."; column 2, line 59 to column 3, line 
33 . .Exemplary trace Tl includes a total of four micro-ops. . .The second exemplary trace T2 
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includes the same loop, L H , L/, L 2 , but does not include any micro-op preceding the loop 
itself. . ."; Figure 1 ; and Figure 2 - In regards to Kyker, the loop contains a plurality of 
instructions, i.e. block of instructions, with three instructions and does not include the extra 
micro-op preceding the loop itself.). 

22. Referring to claim 14, Kyker has taught a processing engine, comprising: 

a. A front end stage to build and store instruction segments (Kyker column 5, line 45 
to column 6, line 2 "...The trace cache also includes, for example, a control bloc 
109 and a instruction decoder 111..."; Figure 10; and Figure 1 1 - In regards to 
Kyker, the trace cache stores portions, e.g. segments, of the program and, as 
shown in Figures 10 and 1 1, is part of the system associated with decoding 
instructions, which is prior to execution and retirement of instructions, i.e. the 
back-end system.), instructions provided therein in reverse program order (Kyker 
Abstract . .a cache unit, which includes a data array that stores traces. . .In one 
exemplary method of unrolling loops, the processor or trace cache unrolls 
loops..."; column 2, line 59 to column 3, line 33 "...when the trace cache 
determines that a loop is present, the trace cache continues to build the trace by 
building additional iterations of the loop until the trace is a minimal length. . .In 
other words, the trace cache builds the loop repeatedly until the trace is, for 
example^ over two trace lines long. . ."; Figure 1 ; and Figure 2 - In regards to 
Kyker, as shown in Figure 2 and explained the in the cited lines, the loop has 
three main instructions and, when the backwards branch is encountered, the 
program jumps backwards, i.e. reverses directions, to the beginning of the loop 
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instead of moving forward in the program. As shown in Figure 2, when Lh is 
encountered, previously stored instructions L 2 and L 3 are stored again in 
consecutive lines in the cache, e.g. the trace cache reverses the program direction 
and stores previously stored instructions again in the trace cache when a loop is 
encountered.), and 

b. An execution unit in communication with the front end stage (Kyker column 5, 
line 61 to column 6, line 2 "...the computer system includes a processor 121, a 
trace cache 101, and a computer memory 123. . ." and Figure 11). 
Claim Rejections - 35 USC § 103 

23. Claims 6-7 and 15-19 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kyker et al., U.S. Patent Number 6,578,138 (herein referred to as Kyker) as applied to claims 5 
and 14 above, and further in view of Rotenberg et al.'s "A Trace Cache Microarchitecture and 
Evaluation" IEEE ©1999 (herein referred to as Rotenberg). 

24. Referring to claim 6, Kyker has taught apparatus comprising; 

a. An instruction cache system (Kyker column 5, lines 45-60 . .an exemplary 
embodiment of a trace cache. . ." and Figure 10), 

b. An instruction segment system (Kyker column 5, line 45 to column 6, line 2 
". . .The trace cache also includes, for example, a control block 109 and a 
instruction decoder 111..."; Figure 10; and Figure 1 1 - In regards to Kyker, the 
trace cache stores portions, e.g. segments, of the program.), comprising: 

i. A fill unit provided in communication with the instruction cache system 

(Kyker column 5, lines 45-60 ". . .the control block 109 facilitates the build 
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process by providing write enable commands to the data array. . ." and 
Figure 10), 

ii. The segment cache of claim 5 included therein (See the above rejection of 
claim 5), and 

25. Kyker has not taught a selector coupled to an output of the instruction cache system and 
to an output of the segment cache. Rotenberg has taught a selector coupled to an output of the 
instruction cache system and to an output of the segment cache (Rotenberg Section 2.1 Trace- 
Level Sequencing ". . .The output of the trace cache is one or more traces. . ."; Section 2.2 
Instruction-Level Sequencing "The outstanding trace buffers in Fig. 2 are used to 1) construct 
new traces that are not in the trace cache and 2) track branch outcomes. . ."; and Figure 2 - In 
regards to Rotenberg, Figure 2 shows the output of the trace cache and the outstanding trace 
buffers are connected to a line that has two inputs and one output, which is a selector. The 
selector chooses between and existing trace in the trace cache, i.e. a trace cache hit, or a newly 
formed trace, i.e. a trace cache miss or misprediction.) A person of ordinary skill in the art at the 
time the invention was made, and as taught by Rotenberg, would have recognized that the trace 
cache system provides fast trace-level sequencing while providing a method to create nonexistent 
traces or repair mispredicted traces (Rotenberg Section 2.1 Trace-Level Sequencing "...The trace 
predictor and trace cache together provide fast trace-level sequencing.. ..Instruction-level 
sequencing, discussed in the next section, is required to construct nonexistent traces or repair 
trace mispredictions."). Therefore, it would have been obvious to a person of ordinary skill in 
the art at the time the invention was made to incorporate the trace caches, buffers, and selector of 
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Rotenberg in the device of Kyker to increase the speed of trace-level sequencing while using 
instruction-level sequencing to create new or correct mispredicted traces. 

26. Referring to claim 7, Kyker has not taught an apparatus of claim 6, wherein the 
instruction segment system further comprises a segment predictor provided in communication 
with the segment cache. Rotenberg has taught an apparatus of claim 6, wherein the instruction 
segment system further comprises a segment predictor provided in communication with the 
segment cache (Rotenberg Section 2.1 Trace-Level Sequencing "...A next trace predictor[\4] 
treats traces as basic units and explicitly predicts sequences of traces.. ." and Figures 2). A 
person of ordinary skill in the art at the time the invention was made, and as taught by 
Rotenberg, would have recognized that the trace predictor achieves high branch prediction 
throughput with a single prediction per cycle (Rotenberg Section 2.1 Trace-Level Sequencing 
"...high branch prediction throughput is implicitly achieved with only a single trace prediction 
per cycle. . ."). Therefore, it would have been obvious to a person of ordinary skill in the art at 
the time the invention was made to incorporate the trace predictor of Rotenberg in the device of 
Kyker to achieve high branch prediction throughput in a single prediction cycle. 

27. Referring to claim 15 5 Kyker has taught the processing engine of claim 14, wherein the 
front-end stage comprises: 

a. An instruction cache system (Kyker column 5, lines 45-60 . .an exemplary 
embodiment of a trace cache..." and Figure 10), 

b. An instruction segment system (Kyker column 5, line 45 to column 6, line 2 
"...The trace cache also includes, for example, a control bloc 109 and a 



Application/Control Number: 09/708,722 Page 14 

Art Unit: 2183 

instruction decoder 111..."; Figure 10; and Figure 1 1 - In regards to Kyker, the 
trace cache stores portions, e.g. segments, of the program.); comprising: 

i. A fill unit provided in communication with the instruction cache system 
(Kyker column 5, lines 45-60 "...the control block 109 facilitates the build 
process by providing write enable commands to the data array. . ." and 
Figure 10), 

ii. A segment cache (Kyker column 5, line 45 to column 6, line 2 ". . .The 
trace cache also includes, for example, a control bloc 109 and a instruction 
decoder 111..."; Figure 10; and Figure 1 1 - In regards to Kyker, the trace 
cache stores portions, e.g. segments, of the program.), and 

28. Kyker has not taught a. selector coupled to an output of the instruction cache system and 
to an output of the segment cache. Rotenberg has taught a selector coupled to an output of the 
instruction cache system and to an output of the segment cache (Rotenberg Section 2.1 Trace- 
Level Sequencing . .The output of the trace cache is one or more traces. . ."; Section 2.2 
Instruction-Level Sequencing "The outstanding trace buffers in Fig. 2 are used to 1) construct 
new traces that are not in the trace cache and 2) track branch outcomes. . ."; and Figure 2 - In 
regards to Rotenberg, Figure 2 shows the output of the trace cache and the outstanding trace 
buffers are connected to a line that has two inputs and one output, which is a selector. The 
selector chooses between and existing trace in the trace cache, i.e. a trace cache hit, or a newly 
formed trace, i.e. a trace cache miss or misprediction.) A person of ordinary skill in the art at the 
time the invention was made, and as taught by Rotenberg, would have recognized that the trace 
cache system provides fast trace-level sequencing while providing a method to create nonexistent 
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traces or repair mispredicted traces (Rotenberg Section 2.1 Trace-Level Sequencing "...The trace 
predictor and trace cache together provide fast trace-level sequencing... Instruction-level 
sequencing, discussed in the next section, is required to construct nonexistent traces or repair 
trace mispredictions."). Therefore, it would have been obvious to a person of ordinary skill in 
the art at the time the invention was made to incorporate the trace caches, buffers, and selector of 
Rotenberg in the device of Kyker to increase the speed of trace-level sequencing while using 
instruction-level sequencing to create new or correct mispredicted traces. 

29. Referring to claim 16, Kyker in view of Rotenberg has taught the processing engine of 
claim 15, wherein the instruction segments are extended blocks (Kyker column 1, lines 30-45 

. .the target address, the backward taken branch, and any micro-ops between the two form a 
loop..."; column 2, line 59 to column 3, line 33 "...Exemplary trace Tl includes a total of four 
micro-ops. . .The second exemplary trace T2 includes the same loop, L H , Ly, L 2 , but does not 
include any micro-op preceding the loop itself..."; Figure 1; and Figure 2 - In regards to Kyker, 
the loop contains a plurality of instructions, i.e. block of instructions, with four instructions and 
includes the extra micro-op preceding the loop itself.). 

30. Referring to claim 1 7, Kyker in view of Rotenberg the processing engine of claim of 15, 
wherein the instruction segments are traces (Kyker Abstract ..a cache unit, which includes a 
data array that stores traces. . .In one exemplary method of unrolling loops, the processor or trace 
cache unrolls loops. . ."; column 2, line 59 to column 3, line 33 ". . . when the trace cache 
determines that a loop is present, the trace cache continues to build the trace by building . 
additional iterations of the loop until the trace is a minimal length... In other words, the trace 
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cache builds the loop repeatedly until the trace is, for example, over two trace lines long..."; 
Figure 1 ; and Figure 2). 

3 1 . Referring to claim 1 8, Kyker in view of Rotenberg has taught the processing engine of 
claim 15, wherein the instruction segments are basic blocks (Kyker column 1, lines 30-45 ..the 
target address, the backward taken branch, and any micro-ops between the two form a loop. . ."; 
column 2, line 59 to column 3, line 33 "...Exemplary trace Tl includes a total of four micro- 
ops.. .The second exemplary trace T2 includes the same loop, L H , L/, L 2 , but does not include 
any micro-op preceding the loop itself. . ."; Figure 1 ; and Figure 2 - In regards to Kyker, the loop 
contains a plurality of instructions, i.e. block of instructions, with three instructions and does not 
include the extra micro-op preceding the loop itself). 

32. Referring to claim 19, Kyker has not taught the processing engine of claim 15, wherein 
the instruction segment cache system further comprises a segment predictor provided in 
communication with the segment cache. Rotenberg has taught the processing engine of claim 
15, wherein the instruction segment cache system further comprises a segment predictor 
provided in communication with the segment cache (Rotenberg Section 2.1 Trace-Level 
Sequencing "...A next trace predictor[\4] treats traces as basic units and explicitly predicts 
sequences of traces. . ." and Figures 2). A person of ordinary skill in the art at the time the 
invention was made, and as taught by Rotenberg, would have recognized that the trace predictor 
achieves high branch prediction throughput with a single prediction per cycle (Rotenberg Section 
2.1 Trace-Level Sequencing "...high branch prediction throughput is implicitly achieved with 
only a single trace prediction per cycle. . .")• Therefore, it would have been obvious to a person 
of ordinary skill in the art at the time the invention was made to incorporate the trace predictor of 
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Rotenberg in the device of Kyker to achieve high branch prediction throughput in a single 
prediction cycle. 

33. Examiner notes that the previous rejection of the claims under 35 U.S.C. §103 has been 
reintroduced. The rejection had been previously withdrawn due to the arguments presented in 
the Appeal Brief filed 20 December 2006 taken in light of the Federal Circuit Court's ruling on 
Teleflex. However, the subsequent ruling from the U.S. Supreme Court on KSR has forced the 
Examiner to re-evaluate that interpretation of the arguments and prior art rejections. 

34. Claims 1, 3-8, 12-15 and 17-19 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Patel et al, Improving Trace Cache Effectiveness with Branch Promotion and Trace 
Packing, in further view of Johnson, U.S. Patent No. 5,924,092. 

35. Regarding claim 1, Patel has taught a cache comprising a cache line to store an 
instruction segment further comprising a plurality of instructions stored in sequential positions of 
the cache line (see Col.l line 26 - Col.2 line 15). Patel has not explicitly taught storing the 
plurality of instructions in sequential positions of a cache line in reverse program order. 

36. However, Johnson has taught the storing of blocks of data in reverse order so that those 
blocks that were in the first block of a data structure that are frequently accessed and modified 
will require less moving and fewer modifications after being placed at the last location of data 
structure, the fewer modifications resulting in improved performance (see Johnson, Col. 4 lines 
13-24). Because the traces of Patel are indexed, as well as accessed and modified, via their head 
(first) instructions (see Patel, Col.4 lines 23-31 and Col. 5 lines 12-18), one of ordinary skill in 
the art would have found it obvious to modify the instruction segment of Patel to store the 
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instructions of the instruction trace in reverse order so that the frequently accessed and modified 
head of the trace will be moved and modified fewer times so that performance is improved. 

37. Regarding claim 3, Patel in view of Johnson has taught the cache of claim 1 , wherein the 
instruction segment is a trace (see Patel, Col.3 lines 2-12). 

38. Regarding claim 4, Patel in view of Johnson has taught the cache of claim 1 , wherein the 
instruction segment is a basic block (see Patel, Col.2 lines 2-5). 

39. Regarding claim 5, Patel has taught a segment cache (see "trace cache" of Fig. 1) for a 
front-end system in a processor, comprising a plurality of cache entries to store instruction 
segments (see Col.l line 26 - Col.2 line 15). Patel has not explicitly taught storing the 
instruction segments in reverse program order. 

40. However, Johnson has taught the storing of blocks of data in reverse order so that those 
blocks that were in the first block of a data structure that are frequently accessed and modified 
will require less moving and fewer modifications after being placed at the last location of data 
structure, the fewer modifications resulting in improved performance (see Johnson, Col.4 lines 
13-24). Because the traces of Patel are indexed, as well as accessed and modified, via their head 
(first) instructions (see Patel, Col.4 lines 23-31 and Col.5 lines 12-18), one of ordinary skill in 
the art would have found it obvious to modify the instruction segment of Patel to store the 
instructions of the instruction trace in reverse order so that the frequently accessed and modified 
head of the trace will be moved and modified fewer times so that performance is improved. 

41 . Regarding claim 6, Patel in view of Johnson has taught an apparatus comprising: 

a. An instruction cache system (see Patel, "instruction cache" of Fig. 1), 

b. An instruction segment system, comprising: 
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i. A fill unit (see Patel, "fill unit" of Fig. 1) provided in communication with 
the instruction cache system, the segment cache of claim 5 included 
therein (see Patel, Fig. 1 ), 
c. A selector (see Patel, "selection logic" of Fig. 1) coupled to an output of the 

instruction cache system and to an output of the segment cache (see Patel, Fig.l). 

42. Regarding claim 7, Patel in view of Johnson has taught the front-end system of claim 6, 
wherein the instruction segment system further comprises a segment predictor (see Patel, 
"multiple branch predictor" of Fig.l) provided in communication with the segment cache. Here, 
when the multiple branch predictor is coupled with the trace cache and mediated by the selection 
logic, it effectively predicts segments because the basic blocks stored in the trace cache all begin 
with branches (see Patel, Col.5 lines 5-29). 

43. Regarding claim 8, Patel has taught a method for storing instruction segments in a 
processor, comprising: 

a. Building an instruction segment based on program flow (see Col.l line 26 - Col.2 
line 15), 

b. Storing the instruction segment in a cache (see Col. 1 line 26 - Col.2 line 1 5). 

44. Patel has not explicitly taught wherein the instruction segment is stored in reverse 
program order. 

45. However, Johnson has taught the storing of blocks of data in reverse order so that those 
blocks that were in the first block of a data structure that are frequently accessed and modified 
will require less moving and fewer modifications after being placed at the last location of data 
structure, the fewer modifications resulting in improved performance (see Johnson, Col.4 lines 
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13-24). Because the traces of Patel are indexed, as well as accessed and modified, via their head 
(first) instructions (see Patel, Col.4 lines 23-31 and Col. 5 lines 12-18), one of ordinary skill in 
the art would have found it obvious to modify the instruction segment of Patel to store the 
instructions of the instruction trace in reverse order so that the frequently accessed and modified 
head of the trace will be moved and modified fewer times so that performance is improved. 

46. Regarding claim 12, Patel in view of Johnson has taught the method of claim 8, wherein 
the instruction segment is a trace (see Patel, Col. 3 lines 2-12). 

47. Regarding claim 13, Patel in view of Johnson has taught the method of claim 8, wherein 
the instruction segment is a basic block (see Patel, Col.2 lines 2-5). 

48. Regarding claim 14, Patel has taught a processing engine, comprising: 

a. A front-end stage to build and store instruction segments (see Col.l line 26 - 
Col.2 line 15), 

b. An execution unit (see "HPS Execution Core" in Fig. 1 ) in communication with 
the front end stage (see CoL5 line 30 - Col. 6 line 4). 

49. Patel has not explicitly taught building and storing the instruction segments in reverse 
program order. 

50. However, Johnson has taught the storing of blocks of data in reverse order so that those 
blocks that were in the first block of a data structure that are frequently accessed and modified 
will require less moving and fewer modifications after being placed at the last location of data 
structure, the fewer modifications resulting in improved performance (see Johnson, Col.4 lines 
13-24). Because the traces of Patel are indexed, as well as accessed and modified, via their head 
(first) instructions (see Patel, Col.4 lines 23-31 and Col.5 lines 12-18), one of ordinary skill in 
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the art would have found it obvious to modify the instruction segment of Patel to store the 
instructions of the instruction trace in reverse order so that the frequently accessed and modified 
head of the trace will be moved and modified, fewer times so that performance is improved. 

5 1 . Regarding claim 15, Patel in view of Johnson has taught the processing engine of claim 
14, wherein the front-end stage comprises: 

a. An instruction cache system (see Patel, "instruction cache" of Fig. 1 ), 

b. An instruction segment system, comprising: 

i. A fill unit (see Patel, "fill unit" of Fig.l) provided in communication with 
the instruction cache system (see Patel, Fig.l), 

ii. A segment cache (see "trace cache" of Fig.l), 

c. A selector (see Patel, "selection logic" of Fig.l) coupled to an output of the 
instruction cache system and to an output of the segment cache (see Patel, Fig. 1). 

52. Regarding claim 1 7, Patel in view of Johnson has taught the method of claim 1 5, wherein 
the instruction segments are traces (see Patel, Col.3 lines 2-12). 

53. Regarding claim 18, Patel in view of Johnson has taught the method of claim 15, wherein 
the instruction segments are basic blocks (see Patel, Col.2 lines 2-5). 

54. Regarding claim 19, Patel in view of Johnson has taught the method of claim 1 5, wherein 
the instruction segment cache system further comprises a segment predictor (see Patel, "multiple 
branch predictor of Fig.l) provided in communication with the segment cache. Here, when the 
multiple branch predictor is coupled with the trace cache and mediated by the selection logic, it 
effectively predicts segments because the basic blocks stored in the trace cache all begin with 
branches (see Patel, Col.5 lines 5-29). 
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55. Claims 2, 9-1 1 and 16 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Patel et al., Improving Trace Cache Effectiveness with Branch Promotion and Trace Packing, in 
view of Johnson, U.S. Patent No. 5,924,092, in further view of Peled et al., U.S. Patent No. 
6,076,144. 

56. Regarding claim 2, Patel in view of Johnson has taught the cache of claim 1, but has not 
explicitly taught wherein the instruction segment is an extended block. 

57. However, Peled has taught trace segments which have multiple entry points and a single 
exit that allow redundant code segments to be eliminated from the trace cache, thereby 
improving cache utilization (see Peled, Col.l lines 60-63, Col.4 lines 13-37, and Fig.3). Because 
the specification has defined an extended block to have multiple entry points and a single exit 
point (see p.2 of Specification), one of ordinary skill in the art would have found it obvious to 
modify the instruction segments of Patel to allow for multiple entry points and a single exit so 
that redundant code segments could be eliminated from the trace cache and performance could 
be improved. 

58. Regarding claim 9, Patel in view of Johnson have taught the method of claim 9, but have 
not explicitly taught wherein the method further comprises: 

a. Building a second instruction segment based on program flow, 

b. If the first and second instruction segments overlap, extending the first instruction 
segment to include non-overlapping instructions from the second instruction 
segment. 

59. However, Peled has taught building a second instruction segment based on program flow 
and subsequently extending the first instruction segment to include the non-overlapping 



Application/Control Number: 09/708,722 Page 23 

Art Unit: 2183 

instructions from the second instruction segment if the two segments overlap (see Col.4 lines 13- 
37) in order to reduce the degree of code redundancy in the trace cache (see Col.l lines 60-63). 
One of ordinary skill in the art would have recognized that it is desirable to reduce redundancy 
within a trace cache so that the cache can be more effectively used and more different traces 
stored. Therefore, one of ordinary skill in the art would have found it obvious to extend an 
existing instruction segment to include non-overlapping instructions from a second instruction 
segment in order to reduce trace cache redundancy. 

60. Regarding claim 1 0, Patel in view of Johnson in further view of Peled has taught the 
method of claim 9, but has not explicitly taught wherein the extending comprises storing the non- 
overlapping instructions in' the cache in reverse program order in successive cache positions 
adjacent to the instructions from the first instruction segment. 

61 . However, Patel in view of Johnson has taught that instructions in instruction segments are 
stored in reverse program order (see paragraphs 21-23 above). Because, an extended segment is 
still an instruction segment, one of ordinary skill in the art would have found it obvious to also 
store the extended instruction segments in reverse program order. 

62. Regarding claim 1 1 , Patel in view of Johnson has taught the method of claim 8, but has 
not explicitly taught wherein the instruction segment is an extended block. 

63. However, Peled has taught trace segments which have multiple entry points and a single 
exit that allow redundant code segments to be eliminated from the trace cache, thereby 
improving cache utilization (see Peled, Col.l lines 60-63, Col.4 lines 13-37, and Fig.3). Because 
the specification has defined an extended block to have multiple entry points and a single exit 
point (see p.2 of Specification), one of ordinary skill in the art would have found it obvious to 
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modify the instruction segments of Patel to allow for multiple entry points and a single exit so 
that redundant code segments could be eliminated from the trace cache and performance could 
be improved. 

64. Regarding claim 1 6, Patel in view of Johnson has taught the method of claim 1 5, but has 
not explicitly taught wherein the instruction segments are extended blocks. 

65. However, Peled has taught trace segments which have multiple entry points and a single 
exit that allow redundant code segments to be eliminated from the trace cache, thereby 
improving cache utilization (see Peled, Col.l lines 60-63, Col.4 lines 13-37, and Fig.3). Because 
the specification has defined an extended block to have multiple entry points and a single exit 
point (see p.2 of Specification), one of ordinary skill in the art would have found it obvious to 
modify the instruction segments of Patel to allow for multiple entry points and a single exit so 
that redundant code segments could be eliminated from the trace cache and performance could 
be improved. 

Response to Arguments 

66. Examiner withdraws the rejections of claims 6-7 under 35 U.S.C. §1 12 in favor of the 
claim amendments. 

67. Examiner withdraws the previous rejections of claims 1-5 under 35 U.S.C. §101 in view 
of Applicants' arguments. However, the Examiner notes that new rejections under 35 U.S.C. 
§101 have been made. 

68. Applicant's arguments filed 1 8 July 2007 have been fully considered but they are not 
persuasive. Applicant's argue in essence on pages 6-9 
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. . .Applicants submit the cited section does not teach or suggest a cache 
comprising a cache line to store an instruction segment further comprising a 
plurality of instructions stored in sequential positions of the cache line in reverse 
program order... Indeed, reverse program order is not discussed at all, even 
generally... Applicants submit the Kyker reference fails to teach or suggest at least 
the above-discussed limitations of claim 1 . . . 
69. This has not been found persuasive. As cited, the claim limitation in question is storing 
an instruction segment in "reverse program order". As explained in the previous rejection and 
copied above, Kyker shows in Figure 2 and explains the in the cited lines, the loop has three 
main instructions and, when the backwards branch is encountered, the program jumps 
backwards, i.e. reverses directions, to the beginning of the loop instead of moving forward in the 
program. As shown in Figure 2, when L H is encountered, previously stored instructions L 2 and 
L 3 are stored again in consecutive lines in the cache, e.g. the trace cache reverses the program 
direction and stores previously stored instructions again in the trace cache when a loop is 
encountered. Applicant's arguments appear to suggest that the phrase "reverse program order" 
has a specific, accepted meaning in the art, but the Examiner could not locate this specific 
meaning within the specification nor in the art. As such, the Examiner read the phrase "reverse 
program order" to mean the program stored in the cache line is in the opposite direction of the 
normal program direction. Kyker meets this limitations since they store in the trace cache 
instructions previously stored in the trace cache when the program direction is reversed by a 
backwards branch from a loop. The program is no longer stored in the trace cache in normal 
program order, i.e. in a forward program direction, because the backwards loop requires the 
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program to branch to an instruction previously stored in the trace cache, thereby forcing the 
reverse program order by making the program direction go backwards to re-execute instructions 
in the loop. 

70. In response to applicant's argument that the references fail to show certain features of 
applicant's invention, it is noted that the features upon which applicant relies (i.e., the specific 
meaning of reverse program order) are not recited in the rejected claim(s). Although the claims 
are interpreted in light of the specification, limitations from the specification are not read into the 
claims. See In re Van Geuns, 988 F.2d 1 181, 26 USPQ2d 1057 (Fed. Cir. 1993). 

Conclusion 

71 . The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

a. Ramagopal et al., U.S. Patent Number 6,006,317, has taught restoring cache lines 
in reverse program order. 

72. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Aimee J. Li whose telephone number is (571) 272-4169. The 
examiner can normally be reached on M-T 7:00am-4:30pm. 

73. If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-4162. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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74. Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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